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ABST RACT 


| A branch and bound technique is used to derive an algorithm for 
computing the alpha-width of any matrix of zeros and ones. Through 
computation of the lewidth of over 200 matrices of various dimensions, 
it is found that less than 20 minutes of computation time on the 
Control Data 160); digital computer is required to complete the compu- 
tation for most matrices. Applications of the algorithm to integer 
programming and to various targeting problems are described. Exten-~ 
sions are suggested for computing the minimal cost alpha-width, and 


for computing a minimal C~cover. 
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ies A Targeting Problem. 


Consider the following rather specialized targeting problem: a 
communications network is given (Fig. 1), in which stations can 
communicate directly only with those stations to which they are con- 
nected by a link. Of course, this would be the case with any kind 
of land line network, but it is possible also, in the case of UHF 
radio communications, micro-wave relay systems, and even signal light. 
We ask this question: what is the minimum number of stations that 
must be destroyed so that the network is totally disrupted; that is, 
so that no pair of surviving stations can comminicate? 

The answer is given in Figure 23 in which those stations targeted 
have been crossed out. It is perhaps surprising to note that the 
station most central to the network; the one directly connected to 
the greatest number of stations, is not targeted. In fact, if this 
station were included in the target list, we should be forced to 
target the four indicated targets anyway, and thus we would have been 
forced away from the optimal solution. 

Let us note, parenthetically, that no claim is made that our 
targeting policy is the best one. It is quite probably valid, and 
indeed optimal, if the purpose of the attack is, for example, the 
total (and temporary) disruption of an enemy's warning system for 
the protection of a second strike to follow immediately. But assume 
that the network is a railroad system. It is quite possible that a 
policy of bombing junctions, switching yards, and accessible rail 
lines would have little lasting effect on the effectiveness of the 


transportation system because of the ready availability of repair 








Figure 1 








equipment and personnel. For example, in the interdiction of the 
French railroads prior to the invasion of Normandy in World War ITI; 
our bombing of marshalling yards and other junctions caused little 
disruption of rail traffic, although it did strain the repair capa- 
bilities of the rail system severely. On the other hand, when 
bridges over the Seine, Oise, and Meuse Rivers were added to the 
target list, results were spectacular. On 26 May, all routes over 
the Seine north of Paris were closed to rail traffic and remained 
closed for the next thirty days. By contrast, marshalling yards 
could be repaired in one or two days. (See pp 217-230; and 
especially p 228 of [6]). 

No matter, this simple problem will serve to illustrate the 
very general algorithm to be described in section 33 without re- 
quiring that cumbersome set-up procedures be learned before getting 
down to!work. 

The solution to this targeting problem was obtained without 
difficulty after the initial error of trying to include the central 
station (number 3) in the target list, merely by inspection of the 
network layout. It is unfortunate that so few communications net- 
works of nine stations and eight connecting links are of interest 
in a problem of this type. Clearly, if a network of interesting 
size were examined (let us say on the order of 15 stations and 35 
connecting links), the solution by inspection would be quite diffi- 
cult. Where, then, are we to look for a method of attack on this 
problem? 

It is well known from the theory of graphs, that every graph 


may be represented by an incidence matrix of zeros and ones; in fact 
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by any of several incidence matrices depending upon the purpose for 
the representation. [1]. For the purpose of this paper we will use 
the following terminology from graph theory: a node of a graph is 
the junction of two or more links of the graph (synonym: vertex); 
an arc is a link between two nodes, and in this paper will be con- 
sidered to be without direction. We define the node-arc incidence 
matrix, A, of a graph, by construction as follows: List the nodes 
of the graph horizontally and the arcs vertically so that they are 


labels of columns and rows of the matrix, respectively. If the yeh 


h 


node is a terminal point of the ig arc, set aij «= 1. Otherwise, 


set aa = 0. The node-arc incidence matrix of the communications 
network of Figure 1 is displayed in Figure 3. 

The targeting problem restated in graph theoretic terms iss 
Find the minimum number of nodes so that each arc of the graph has 
at least one of the nodes as a terminal. Since we already know the 
answer to this simple problem, it would be well to examine this 
solution applied to the node-arc incidence matrix. We construct a 
new matrix from the incidence matrix by including only those columns 
labelled with one of the nodes in the solution set. This matrix is 
displayed in Figure . It contains the same number of rows as the 
original matrix, but has only four columns. We note that whereas 
there were two "1"'ts in each of the rows of the incidence matrix 
(one for each terminal of each arc); there is only one "1" in the 
sub-matrix. 

A little reflection upon the above observation leads to a third 
formulation of the targeting problems given the node-arc incidence 


matrix of the communications network, find the smallest subset of 
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columns of the matrix with the property that each row is represented 

by at least one "1" in this subset of columns. But this smallest 

subset of columns is precisely what Fulkerson and Ryser call a 

minimal set of representatives for the (0,1) matrix, A3 and the 
cardinality of this set is called the "width" of A. [lh]. The problem 
may be generalized: we require that each row of the matrix be 
represented by at least alpha "l'"t's (where alpha is a positive integer). 
We shall use the terminology "minimal o-set of representatives for 

the (0,1) matrix, A"; and "o-width of A". This terminology is due 

also to Fulkerson and Ryser. [bh]. 

Thus the simple targeting problem may be solved by finding the 
l-width of the node-arc incidence matrix of the communications network. 
It is the purpose of this paper to present an algorithm for finding 
the a-width of any (0,1) matrix; and for specifying at least one 
minimal o-set of representatives for that matrix. Since we already 
have solved one problem of this type, we shall use this communications 
network and its associated incidence matrix for illustrative purposes 
throughout the balance of this paper. 

We now state the general problem which we desire to solves given 
a finite set, X, and a class, Y, of k non-empty subsets of X (but not 
necessarily the class of all non-empty subsets of X), find a sub- 
class, Z, of Y, with the property that if x € X, then x is a member 
of at least a of the members of Z. This is a quite general problem, 
as will be shown in later sections of this paper. Any problem which 
can be formulated in the terms specified in this paragraph is capable 
of being solved by the algorithm to be presented. The incidence 


matrix for this abstract problem is constructed by listing members of 





X vertically and subsets of X horizontally. Then we place a "1" 


h th 


in the yth row and AL column if the i~ member of X is a member 


of the ngs subset of X. 


2. The Class, ARS re 


Let A denote the (0,1) matrix of size m by n3 that is, Aisa 
matrix with m rows and n columns, each of whose elements is either 
zero or one. Let the sum of all of the elements of the it? row be 
denoted by r;3 and the sum of all of the elements in the yeh column 


be denoted by Si. That is: 


n 

(1) ~s Bij =; (fel, ..., m) 
js} 
m 

(2) ar = S (ieie 2, «<s, M) 


m n 
We note that “ a yy Sj 

i=] j=l 
We call the column vector, (r,, Poy cess =) = R, the row sum vector; 
and the row vector, (s,, Sy, e205 s) = S, the column sum vector. We 
denote by 2(R,S) the class of all (0,1) matrices of size m by n with 
row sum vector, and column sum vector, R and S, respectively. 

From the class, ‘U(R,S), many very interesting combinatorial 
results may be obtained. An excellent survey of this material may be 
found in Ryser.{10]. We will be concerned primarily with a parameter, 
a or (a), of the class, which is defined as the greatest lower 
bound on the a-width of any matrix in WU(R,S). That is, € is the 
Qewidth of the matrix in AL(R,S) which has the smallest o-width of any 


matrix in the class. 





Although not of concern until a later section, it will be of 
interest to determine under what conditions the class, (RS) is non- 
empty. Let 6, = (1, 1, »«6, 1, 1, 0, 0, «2s, 0) be @h n-dimensional 
vector with the first r. components equal to one, and the remaining 
components equal to zero. We then define a matrix of the forn, 


nh=- Tj 


A= Sy 


called the maximal matrix with row sum vector, R. It has column sum 
m — 


vector, S = (S15 Soy ese5 S n)° Now since \ ree >. Sa 10m 
iel et J 

fixed, s is unique, by definition of the rr and the class WU(R,S), 

by a simple contradiction argument, has only one members; namely, A. 

Let Q = (q)5 do, +++, q,) and Qe = (GF, Gs sees a) be any two 
k-dimensional vectors whose components are non-negative integers. We 
say that Q is majorized by Qt, denoted Q < Q*, provided that with sub- 

> ° 
scripts renumbered so that q, = do Zeer = V3 and ry = qs ae eae qT» 


the following statements are true: 


3) tater ta, 


IA 
ok 
+ 
ns 


eco + a, (j#1,2,.+-,k=l) 


(hy) a4 i deed + ee 


rk 


We say that Q is normalized if q, = Io a qi. These two definitions 
now enable us to give conditions under which LICRES) is non-empty. 
Theorem 2.1 


Let R = (ry; re See rs and § = (s,, eee s) be two 


22 2° 


normalized vectors whose components are non-negative integers, and 


such that ya i ef : Let A be the maximal matrix of size m 


—=- 


1 
by n, with row sum - vbtton R, and column sum vector §S = ie — eee i 


8 





Then a necessary and sufficient condition that 4(R,S) be non-empty 


is that S <~ S. 
n n m 


Proof: Assume SKS. Since ». =F id > S, = >. EG tt 
jel jer 2 ixl i 


must be that SA S because equation (3), above, is violated, that is, 


for some k, it must be the case that S, + Sy + e+e + S. > Sy + S, 
tas $2 But then A is not maximal, since the first k columns of 
A contain more "1"'s than the first k columns of A. The hypothesis 
is that A is maximal, so we have arrived at a contradiction, thus 
demonstrating the necessity of the theorem. 

To show sufficiency; we shall construct a matrix, Ac YU(R,S) 
from the maxtrix A. This construction is due to Ryser. [9] . The 
construction will proceed by shifting ones in the gbh row of A to 
other positions in the same row. We note again, that R, S, and S are 
all normalized, and that S < S. iby o. ae rearrange the ones in the 
rows of A so that only S, ones remain in the first column. We may do 


this unless S; ? Ss) (Jee, .2.5 Mf), inewhich case, S + Sy + oe + 


> é _ = Pe ne ° "I 
S, 7 MSy SS) + Sot -e- + SF Sy + ... +S 3 an absurdity. We 
continue by induction. Suppose that the first t columns of A have 
been rearranged. The matrix thus far constructed has the forn, 
t «= 
At =[by by + + +b, Diggs © - b, | 

where there are S; ones in the jh column of A! (j m=, :.., t). 

We now construct the (t+1)St column. Let the number of ones in 
the ig column be (j = t+#l, ..., n).» We may construct A! without 
loss of generality such that, Sie aie =s. Now it is possible 


that either Sie. < Stayt or that Stay? State We consider each case 


in turns 





Case Iz Sta, = Stay 

Remove ones from the (t41)5© column, placing them in other 
columns to the right. If sufficiently many ones may be removed by this 
procedure, the column of A is constructed, and we are finished. Sup- 
pose therefore, that there remain, d ones in column t+#l, so that 
St41 < d Sst, ,- Let the matrix at this stage be denoted by oan 
Now if d > s,,,, then for every @r.t+1 13; we must have er} = 
(j = t#2,..., mn). Hence s:4) + ... + S, must at least equal d-(n-t). 
But Si41 “ ae Si 4.9 = S441 < d3 etc., so that 

Stay tooo Sq < (n-t)ed S sy, +... + 8) 

an absurdity. 
Case Il: s,,, 7 Sta) > 

Insert ones in the (a5) oe column from columns to the right. If 
sufficiently many ones can be inserted, we are finished. We therefore 
assume that sufficiently many ones cannot be inserted by this procedure; 
in fact, we assume that column t+l contains only d ones such that 
Ste. =A < Sey, - Again, let the matrix at this stage of construction 
be denoted by [e,. J. Then if er tel = 0, it must be the case that 
@n3 O (j = t#l, ..., n). Now suppose that ea = 1 for some j = t+2. 
Then either Cok * 1 for all kK Stl, or els®, for some k= t, Cok = 0. 
Consider the case in which Cok * O. Since s, =Si4, > d, there must 
O. We interchange e 


exist ek = 1, and e and Cons and also 


q,t+1 ’ qj 


interchange Cok and e This increases the value of d by one 


Dette: 
without changing the value of any column sum for columns to the left 
of column t+l. Suppose we make all such interchanges and still, 

d < S4,- This situation includes the case mentioned above, that 

ek =] forall k =t+l. It is no longer possible to shift ones from 


Columns CF2, s<8, MS tnto columns 1, ..., t#l. This must. mean that 
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either all of the ones for a given row are in columns to the left of 
column t+23 or that all of elements of a given row to the left of 
column t+2 are equal to one. In either case, it must be that, 

S7 +t oes + S + d= $1 gee S; + St41 
But then, since S <s, 

Sj + oe + 84) SS, + oes Siu, (= 8) + 0 + 8, + ) 
whence S;44 = ds; contrary to the assumption. QED 

We now consider an extension of the concept of G-width. Let 
Cz (c,, sGueis Cc.) be an m-dimensional vector of non-negative integers. 
We wish to find the smallest subset of columns of A ec “L{(R,S) such 
that the ith row of A is represented by at least c; ones in this sub- 
set of columns. Such a subset of columns will be called a minimal C- 
cover for A, and we shall denote its cardinality by e€(C), called the 
Cawidth of A. Clearly, if Cre (a, ..«; a), then €(C) = €(a). We 
define 4(C) to be the greatest lower bound on the C-width of any 
matrix in %U(R,S). (C) can be estimated by o(C) as follows: 

(S) o(C) = the smallest integer such that Ya = yy - 


We shall use this formula in the algorithm to be presented in section 


™ 
C.; 


three. Note that if C = (a, -.., a): then p(C) = the smallest integer 
fe) 

such that 2 Se a ea 
ja > 


3. Derivation of the Algorithm. 


We shall now describe our algorithm for finding the o-width of a 
(0,1) matrix. The branch and bound technique was suggested to me by 
D. R. Fulkerson of the RAND Corporation, Santa Monica, California, and 
is patterned after the branch and bound solution to the travelling 


salesman problem designed by Little, et alii. [7] 


Et 





We have several techniques for estimating e(a)(which we shall 
henceforth call the g-width of the class, AS(R,S)). One such tech- 
nique is described in the preceding section, in which we compute the 
parameter, ©. Now a given (0,1) matrix of size m by n, is a member of 
a class, 2J{(R,S). We can partition the class, U(R,S) into two sub- 
classes, one consisting of those matrices which have a selected column, 
say column p, as a member of a minimal a=-set; and the other consisting 
of those matrices for which column p is not a member of any minimal 
a-set for the matrix. 

We thus have two sub=classes, each of which has no more members 
than the original class, and we know that the original matrix must be 
in one, and only one of the sub-classes. Consider the sub-class whose 
matrices have column p as a member of a minimal a-set. We may use 
this information to reduce the dimensions of all the matrices in the 
class as follows: if a = 1, then every row which has a one in column 
p is adequately represented by column p, and needs not be considered 
subsequently. If « # 1, we still may note that these same rows are 
represented once by column p, and thus need be represented only ae 1 
more times subsequently. Furthermore, we have made a "decision" 
about column p, namely that it is included ina minimal a-set of all 
matrices in this sub-class. We may thus reduce the dimensions of all 
matrices of the sub-class by one column, and (if a= 1) by a number of 
rows. If « # 1, we will keep track of those rows which yet need 
only © = 1 representatives. Hence we have for this class a vector, 

C, whose components are either a, ora =- l. 
We may also reduce the dimensions of the matrices in the other 


sub-class by one coiumn, for we have made a "decision" for this sub-class 


We 





namely, that no a-set contains column p, for any matrix of the sub-class. 
Hence every row of the matrices of this sub-class needs to be subsequent- 
ly represented a times, regardless of the value of a. 

Now let us estimate © for each of the two sub-classes. It is clear 
that these two numbers are both estimates of the a-width of the original 
matrix. We may actually improve the estimate for the first sub-class 
discussed, by adding one to the estimate of © for the sub-class. This 
is to account for the inclusion of column p in any a-set for any matrix 
in this sub-class. Now, it is certain that the smaller of these two 
numbers is not greater than the a-width of the original matrix. 

Let us examine the sub-class corresponding to the smaller of the 
two estimates. We may partition this sub-class into two sub-classes, 
and so forth, until finally, some sub-class will be so small as to 
contain a unique matrix whose C-width we can determine by inspection. 
Part of such a continuing procedure is represented by the tree struc- 
ture of Figure 5. 

Now at any point in the procedure, the set of junctions (Fig. 5) 
which have no lines leading toward another junction represent a 
partition of the class to which the original matrix belongs into two 
or more sub-classes. By an obvious extension of the above discussion, 
the smallest of the several estimates for the %width of the original 
matrix is not larger than the a-width of the original matrix. We 
may then faecus our attention on the sub-class corresponding to this 
smallest estimate, branching out from the corresponding junction until 
one of the earlier estimates of e€ is smaller than any of the most 
recently constructed estimates. Now let us set up a formal algorithm 


based upon the preceding discussion. 


13 








: 7 7 
one 
= 


= en bey ; 
. ae 7 ~ 


_ 


, 





_ x 
7 





G aandt J 


apnytoxa apny out 


2T NWITIOO 






*Tomal ° TOUT 


“ “Tout wy) “7Oxe |) ° Tout 





S NWN109 @ NWM109 OT NWNM109 L NWNTI09 3 
Load Tas “A Loa Tas Doatss 
QPNT OXx2 apntour C apN{ oxa apnt out 






€ NWYIOO JOSTES 
apnzout 


apnytoxa 


T NWYIO0 JOZTES 


tae 





Let the (0,1) matrix, Ac 2(R,S), be given, with dimensions 
m by ne A matrix is said to be normalized when both its row sum and 
column sum vectors are normalized, and when the elements of the matrix 
have been rearranged so as to fit the new row and column sum vectors. 
Clearly, we lose no generality by considering only normalized matrices. 
Therefore, throughout the remainder of this paper, we assume that all 
matrices and sub=-matrices have been normalized as part of the operation 
of constructing them. 

Notation will, of necessity, become rather cumbersome, and for 
that reason, we now present such notation as we shall need in this 
section. There will be certain preliminary steps which serve to de- 
crease the amount of work required in the main part of the algorithm, 
and since these preliminary steps are not always applicable, we shall 
assume that the given matrix, A e« ALR ,S), is the one with which we 
shall enter the main part of the algorithm. 

The procedure in the algorithm is basically broken into two 
parts; (1) selecting a column for inspection and deriving the two 
sub-classes corresponding to the inclusion in, and the exclusion from 
the a-set of the selected column (the "branch" portion); and (2) 
estimating € from each sub-class and choosing among all estimates, the 
smallest for the next iteration (the "bound" portion). We shall 
carry out the "bound" portion of the procedure by calculating p for 
each of the sub-classes and adding to p, the number of columns pre- 
viously included in the a=set on the current branch. Eventually we 
shall obtain a sub-class of matrices, one of whose dimensions is zero, 
and is thus, empty. Clearly, © for this sub-class is zero. We can 


make a test for completion at this point. If the test fails, we 


1 





continue the algorithm along some other branch. It can be seen that 
we shall derive an even number of different sub-matrices of A before 
we reach termination. We shall subscript these sub-matrices in the 
order in which they are derived. Associated with each of the sub- 
matrices, of course, will be a row sum vector, a column sum vector, an 
estimate of the C-width of the class to which the sub-matrix belongs, 
and an estimate of the a-width of A based upon the condition that it 
can be obtained by continuing along the branch from which we derived 
the present sub-matrix. Note that since we may discontinue considera- 
tion of one branch at any time, and return to a previously discontinued 
branch; the subscripts of the matrices which we shall derive at any 
point of the procedure bear no relation to the subscript of the matrix 
from which the derivation follows. This point will be made again 
during our step by step description of the algorithm. 

Now, we subscript every parameter associated with a particular 
sub-matrix with the same sub=script as its associated sub-matrix. We 
shall also require a "label" for each sub-matrix, and the typical 
label will be of the form a ,b,c,d,...". This label gives us the 
information that for each particular sub-matrix, every column of A 
which is present in the label has been branched upon; and those columns 
which appear unbarred are assumed to be included in the asset, whereas 
those which appear with a bar over them are assumed to be excluded 
from the set. Thus, the a,b,c,d in the example label above represent 
positive integers which are the column numbers of the original A matrix. 
One further convention to which we shall adhere; an even sub-script 
is taken to mean that the latest column upon which we branched is 


considered to be included in the a=-set associated with the sub-ematrix, 
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and thus this column number will appear unbarred in the associated 
label. On the other hand, an odd subscript is taken to mean that the 
latest column branched upon is considered to be excluded from the a-set 
associated with the sub-matrix, and thus this column number will appear 
in the associated label with a bar over it. The described notation is 


summarized below: 


A S ) has di sions Rae 
p € AR; 2 as dimen it m by 5 
D = (Top 2? ce eyg “pm. ° 


S. = (spy Snr co09 S ys 


=p 


k m 
D the minimum k such that Dat’?! aa D 1°Pi 


p 


D 
ft 


o. * the number of columns unbarred in the label of nae 


p p 


It is obvious that much information must be recorded for each of 
several matrices. Although a structure similar to that of Figure 5 
could be used, we suggest the format of Figure 6. This figure shows 
a typical matrix A, and all of the required information associated 
with this matrix. It will be convenient to suppress zero elements of 
the matrix. Note that we list the subscripts of the columns of the 
original matrix along the top, and directly below that, the order of 
subscripts for the derived matrix, an The order of subscripts for 
the rows of A. is listed along the left side of the matrix, and Ry and 
Sp are listed along the right side and the bottom, respectively. At 
some convenient point we list Py? e and the label associated with 
the matrix. 

In section four, we solve the targeting problem of section one 


using this algorithm. The reader may desire to read section four 
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concurrently with the description of the algorithm which follows. 


aot Preliminary Steps. 


Pics Lf rT < 8S the a-width does not exist. We terminate, or else 
decide to look for an a-width in which a is a smaller integer than 
that which the original problem specified. 


ree Jf Tr, 7? 43 go directly to step Sl, in the main part of the 


algorithm. 
3 10 1 2 5 (column subscripts of A) 
1 2 \ R 
3 5 
1 l l l l Ly 
2 l l l 3 
3 i i l 3 
hy 1 2 p = 2 
Dp 
5 1 1 
6 1 1 te 5 
=p 
Sy 3 3 3 5 2 _ — 
Mg sO 50508 
Figure 6 
= = = = = < —, ° 
P3. If r. CS eR ee for some k, (0 =k = mel); 


then it is evident that each "1" in any of these k+l rows must belong 
to a column in the minimal a-set of representatives for A. Therefore, 
- = 1, record 


J 
that the oe column is in the minimal o-set and delete the ue column 


in each such row, say the jth, for each j such that a; 


from the matrix. Let C = (a, ..., a) be an m-dimensional vector. For 


ch 


each kK such that any * 1 subtract one from the component of C. 
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a > aa 


When this has been done for all columns, j, deleted from the matrix, 


delete any row, i, for which C, = 0. Finally, recompute new row sum 


and column sum vectors, normalize the new matrix, and proceed to 

step Sl in the main portion of the algorithm. Now the set of columns 
that has been deleted in this preliminary step will not again be ex- 
plicitly mentioned. The reader is cautioned to remember to add 

these columns to the ®-set computed in the next section in order to 


arrive at the true a=-width of the matrix, A. 


3.2 The Branch and Bound Algorithn. 
Sl. We are givenaA € Q(R,S) which has been normalized. If C was 


not computed in step P3, let C = (a, «ee, a). Cross out column one of A. 
We shall branch on this column because it is the column with the largest 
column sum. This is an entirely arbitrary decision. We could branch 
on any column whatsoever, but it seems reasonable that the one with the 
largest column sum would be likely to be included in the a-set. A 
counterexample is easy to construct. In any case, it is now necessary 
to decide whether or not to include this column in the o-set. 

S2. Let us denote the matrix, A by is, 6, coe 6 Je Construct 
A, (= [S, 6. es 6) and label it "1". We examine the consequences of 
excluding column one from the a-set. A, is of size m, by n, (=n-1), 
and A, E RMR, »S1)- Our decision means that we still must locate C; 
representatives for each row, but that we may not use any of the "l"%s5 
in the excluded column of A. Calculate 0, by equation (2-5), and 
since no columns are unbarred in the label, let a = 0,- 
S3. We next examine the consequences of including column one 


in the G-set. Construct A, by deleting column one from A, forming a 


19 





temporary R, vector, and deleting every row for which the component- 


2 
wise difference of R and Ry is greater than or equal to the corres- 
ponding component of C. That is, if r, - ly; = C55 delete row i. 
Clearly this can happen at this step only if ae 1. We have now 
reduced A by one column and perhaps some number of rows. This re= 
duced matrix, when normalized is called Aj, and we now form the 
permanent vectors, Ry and Sj. We label this sub-matrix, "i". Po is 
calculated by equation (2-5), and since column one is unbarred in the 
associated label, e = P4 + 1. 

Sh. We must now decide along which branch it will be most 
profitable to continue. We make the decision by choosing the sub-= 
matrix associated with min [e!, eh]. Lf oa = e5? the choice is 
arbitrary. When using the algorithm for hand computation, the best 
choice is probably that matrix with the greatest number of unbarred 


columns in the associated label, that is,in this case, A Having 


2° 
made this decision, we set min ley, E5] =o, SO that the same 
branch will not be chosen again at a later stage. We proceed to step 
S5. All succeeding steps in the algorithm will be described in 
general terms. 

SS. In the preceeding step, we decided to proceed using matrix 
Ay > say, with associated label, "ny q,r,Syt,u"s a particular one of the 
k matrices thus far constructed (k= L). Since A, has been normalized, 
the first column has the largest column sum. We therefore select 
this column as the next branch point. Let us say that this column 


th column of A. 


corresponds to the v 
S6. Denoting AL by LS, 1259. oer Sim | we construct the next 


sub=-matrix, Avi, (#[5z9, S139 ees Sump)» thus finding the sub-matrix 


20 





- 
_ 


_ 
ss 


Tr 


a 


- 
— 7 
a 7 
- > -_ 
= 
7 _ 


as 





a 





a 
a aa 


= 


corresponding to a decision to exclude column one of AL (column v 
of A) from the o~set. A,,, is of size Mat by My, and Ay, 
= ARR 12 Si) ‘ 

S7. Noting that columns p, q, and s of A have been included up 
to this stage, we form a vector of row sums of included columns, which 
we shall call RS. That is, referring back to. the A matrix, we compute 
for each row, the number of ones in columns p, q, and s. Obviously, 
for this particular label, the sum cannot exceed three. Now we 
compute a test vector, RI. Let the jth component of RI be the maximum 


th component of RS). The vector, RI, gives 


of zero and c, ~ rs; (the i 
us the number of ones yet to be included in each row of the matrix, 
by some subsequent choice of columns. Since both RS and ET are 
vectors which are required only at this branch, and will thence be 
discarded, there is no need to subscript them. 

S8. Returning to our decision to exclude column one of Ay» we 
examine each component of R,,,. If any component of Rial is less 


than its corresponding component of RI, it is infeasible to exclude 


this column. We set e! =z o, but retain the label of A which is, 


k+l] k#1 


in this case, "'p,q,7,S,t,u,v". We then proceed to the next step. 
If, on the other hand, we determine that the label represents a 
feasible set of columns, that is, no component of Rie] is less than 
its corresponding component of RI, we compute Oral by equation (2-5) 
using the vector RT in place of C. Since there are three unbarred 
columns in this typical label, we set Chel - een 3. 

S9. We now construct matrix Ai 4d with label fing een uae 


This matrix corresponds to the decision to include column v of A in 


the a-set. We delete column one of A,. For every row which had a 
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"2" column one of Al > we subtract one from the appropriate component 
of Rf. If this component now is zero, we delete the corresponding 
row of A;. When the procedure is completed, we have the matrix A,,5. 
We compute Ry.» and S,,53 and finally, ef,5 which equals py,o + bh 
in this case. If, however, either dimension of the matrix becomes 
zero at this step, we proceed to step Sll, as it is possible that 
this represents termination. Pre? of course, is computed using the 
vector RT instead of C. 

S10. Let P be the k+2 dimensional vector whose components are 
the el. We find the minimum component of P, choosing arbitrarily 
in case of a tie, and use this component's corresponding matrix for 
our next path. Although an arbitrary choice in case of a tie will lead 
to solution, there are two techniques for choosing between branches 
that will probably shorten the algorithm somewhat. These are, either 
to stay with the current branch in case of a tie in which the current 
branch is involved3 or to take the branch which has the largest number 
of unbarred columns in its label. The second method is probably the 
best, but in the computer algorithm we shall use neither technique; 
branching instead on the matrix with the smallest subescript because 
of programming simplicity. Let us say we have chosen matrix Ay for 
our branching matrix. We set er = ©, and return to step S5, con= 
tinuing the algorithm. 

Sills. Since Aso is of zero dimension; Piao 8 O. Then SEED is 
equal to the number of columns that are unbarred in the label of 
Apso Now if a > cj for any i < k+#2, we have not necessarily 


found a solution, so we return to step S10, after duly recording the 


proper values for all of the parameters associated with this sub-ematrix. 
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Of course, it is now meaningless to consider row sum and column sum 
vectors. This is a minor point, since the only purpose of these vec- 
tors is in computing pe, and in deriving subsequent matrices. It is 
clear, though, that if we at a later time choose to branch upon this 
matrix of zero dimension, it is because we have found that it is after 
all, an optimal solution to our problem. 

If, on the other hand, Ef eo = ec; for all i = k#2, the unbarred 
columns in the label of Aigo constitute a minimal a-set of representa- 
tives for A, and the cardinality of this set of columns is the a- 
width of A. Thus we have arrived at a termination point of the 
algorithm. In the next sub-section, we shall prove that the algorithm 


does find a minimal %-set of representatives, and that it terminates 


in a finite number of steps. 


3.3 Proof that a Solution is Reached. 

We need to show that the algorithm does find a minimal a-set of 
representatives even though many possible combinations of columns 
have not been considered. It is first necessary, though, to show 
conditions under which the a-width exists. We have already stated, in 
step Pl, enat it a < ry» the aawidth does not exist. We now prove a 
necessary and sufficient condition for the existence of the more 
general Cecover of A: 
Theorem 3.1. 

The matrix, A, has a C=width for every vector, C, whose components, 
c; are bounded above by I; 6 


Proofs By hypothesis, c; = r;3; hence A, itself, is a C-cover for 


every admissible vector, C. For a fixed C, the collection of all 
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C-covers is thus non-empty, and clearly is finite. Then the collec- 
tion has a minimal member, and the cardinality of this minimal mem- 
ber is the Cewidth of A. 

QED 
Corollary: 

The matrix, A, has an a-width, e(a), for each integer a in the 
interval, Lf oir. 
We shall now demonstrate that the branch and bound technique 
of section 3.2 will find the minimal C-cover of A in a finite number 

of iterations. We shall further show, that the branch and bound 
technique is independent of the technique for computing a bound on 
€, under some rather simple restrictions. We shall call ¢(C), the 
C-width of the class, QR,S). Now €(c) is clearly a function of 
the dimensions of the matrices in the class. Let p*(C) be an esti- 
mate of €(C) such that o%(C) = €(c), and such that for the class “A 9 
one of whose dimensions is zero, o%(0) a= (0) = Q; mere © is the m 
dimensional zero vector. We insist in what follows that the esti- 
mating technique for computing p¥(C) be applied consistently. The 
parameter, P, described in section two satisfies the above require- 
ments on e¥(C). 

Let A be a given matrix and estimate (C) by p#(C). Then the 
C-width of A is not less than p#(C). Now construct matrices A, and 
A, as in steps S6 and S9 of section 3.2 using any column of A, say 
column t, instead of that column whose sum is the largest. Estimate 
a(G,) for each of the sub-matrices thus constructed by p¥(Cy)5 and 


P% (Co) respectively. Then the C,-width of A, is not les: than px(Cy)> 


1 
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and the Co-width of A. is not less than p#(C,) - The vectors, C,, 


2 
have components equal to the number of "1"'s yet necessary to rep- 


th th component of C 


th 


resent the i” row of A. For example, if the i 


were 2 and the selected column contained a one in its i” place, then 
the ae component of C, would be 23; but the ith component of Cy would 
be 1. Now since column t of A must be either included in, or excluded 
from the minimal C-cover of A, the C-width of A is not less than 

min [o¥(C,), px(C,)+11. We need no longer consider p*(C) as an 
estimate of the C-width of A. Clearly then, if we repeat this esti- 
mating process, using A, or Ay as a new given matrix according to 
whether px(C,) or px(C,)+1 is the smaller, we may compute two 
additional estimates of the Cewidth of A. Eventually (after a finite 
number of such estimates have been made), we shall construct a matrix, 
one of whose dimensions is zero. In that case, p% (Cop) = 0, and 

the C-width of A cannot be less than the cardinality of the set of 
columns slated for inclusion in the Cecover of A. This set of columns 
is, in fact, a C-cover, and if the cardinality of this set is less 
than or equal to all of the other computed estimates of the C-width 

of A, then it is a minimal C-cover, since we required that any esti- 
mate be bounded above by e(C). 

We refer the reader once again to the scheme illustrated in 
Figure 5. If each branch of this tree were to be taken to its termi- 
nation (at worst, the point at which each column of A had been tested 
either for inclusion or exclusion), each such terminal could be 
represented by an n-tuple as follows: let the ae component be one 


th 


if the i~ column had been included on this branch, and let it be 


zero otherwise. There are 2” unique n-tuples, hence at most 2” 
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corresponding terminals, each attainable in a finite number of steps. 


Hence the algorithm must terminate in a finite number of steps. 


4. Manual Computation with the Algorithm. 


Let us return to the targeting problem described in section one, 
and solve this problem to illustrate the use of the branch and bound 
algorithm. We reproduce the matrix of Figure 3, as Figure 7 for 
ready reference. Zeros have been suppressed, and we have appended the 
components of R and S to the right and bottom of the matrix, respectively. 
Figure 8 depicts the normalized matrix, A. We have appended the original 
column subscripts above the matrix. 

In this example, = 13 and it should be noted that in general, 
increasing %, significantly increases the complexity of the manual 
algorithm, because of short-cuts used in deciding which rows may be 
deleted. These short-cuts are not available fora > 1. The RI vector 
need not be constructed, since its components could only be zero or 
one, and such a simple vector can be handled by inspection. However, 
the shortecuts cannot be conveniently programmed, so the computer 


version of the algorithm can handle differing o's with almost equal 


facility. 

eee es) ee en ©) (© R 
A 1 L 2 
B l l 2 
c 1 i 2 
D i 1 2 
Ee 1 1 2 
rE 1 1 2 
G l l 2 
H l l 2 
S l 2 r 2 i l 2 2 l 


Figure 7 
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A l 2 
B l l 2 
C l l C 
D l l 2 
E l if es 
oy 1 l 2 
G l L 2 
H L l 2 
S Ly 2 2 2 2 it l l li 


Figure 8 


We show, in Figures 9 and 10, the matrices A, and Aj, respectively, 
derived from A as follows: We delete column 1 from A and since this 
column corresponds to column 3 of the original matrix, we label A> 
n3M, Naturally, we have normalized both R, and Sy° Now locate each 
row of A which has a "1" in the first column. Delete this row, delete 
column 1, and we now have Ay; after normalizing Ro and Sos This 
criterion for deleting rows is a simplification of computing RIT, which 
is the short-cut mentioned at the beginning of this section. The 
rows deleted in the example are rows B, C, G, and F. We label Ags 
LL 

\ 

For the matrix, Ay» op, = lh, since ae xm, = 8. Similarly, 

for A,, % * h, since yy So; = Mo = i. Hence c = Oe hs and 


+l «5, 


is] 
t = ‘ 
“5 Po 


ae 





l 1 


set et =o, Coltmn | of A, corresponds to column 2 of the incidence 


matrix, and is the column which has the largest column sum. 


Min less elk = €' so we choose to branch on matrix A,. We 


A 1 l 2 
D l 1 2 
E l l Ma 
H l l 2 
BL l 
C l L 
r L 1 
G l l 


Say re 2 2 2 l t l l 


Subematrix, A,- ee p 


Figure 9 


A ol 1 e 
D l l 2 
E 1 l 2 
H l l 2 
S, 1 f 1 l tl i 1 l 
Subematrix , Age "3" Pee les ef « 5 


Figure 10 
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We delete column one of A. and thus have matrix A_, which we 


l 3 
label "3,2". This sube-matrix is shown in Figure 11, below. Since 


row B of A, has row sum Zero in Ay; and since there are no included 


1 


columns in the label, this represents an infeasible set of column 


exclusions. Therefore, without further consideration, we set eS m ©, 


~ 


3 
ee l 2 
E l l 2 
H 1 l 2 
A 1 l 
C 1 1 
F l 1 
G l 1 
33 2 2 2 1 1 1 l 

Sub=-matrix Az. "3,01 . = © 


Figure 11 


Next, we delete each row of A, which has a "1" in the first 


column, namely, rows A and Bs and we delete the first column of Aye 
This gives us matrix Ay depicted on the next page in Figure 12. Of 


course, the label for A ig 43,2". Now >. S157, = 63 so oy x 3, 


i=] 
and “Th = 4) + 12). Note that the sum of column ) of A, goes to zero 


te so we may delete it. 


Now min le.» Ef 


matrix Ay, We set a = ©, and choose column one of A, for examination. 


\ 


, eh» ej] = ay = ), Hence we choose to branch next on 


ey 





oO. 


nO QO 


mn 


2 


2 


Sub-matr ix Ay. Wee 


8 5 C6 9 
3 4H 5 6 R, 
l 2 
l 2 
l u 2 
i 
l 
l i 
2 l L l 
Operas ete ly 
Figune: 12 


Deleting this column, which corresponds to column ) of the 


original matrix, produces Aes with label "3,2,4". This sub-matrix is 


reproduced in Figure 13, below. 


gone to zero in Age 


Gi Fh | ee 


S 


5 


Me 


Sub-matrix Age 


Note that the sum of row C of Ay, has 


Oe? 
Sr R. 
1 2 
1 2 
l 
1 
1 L 
l l 1 
"3,2," on = 
Figure 13 
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This means that the label represents an infeasible combination of 
columns, and we therefore set ey = © without further consideration of 
this sub-matrix. 

Now we derive sub-matrix Ag by deleting rows D and C from Ay 
since each of these rows has a "1" in column one of Ay We also delete 


column one of A) and give this sub-matrix the label, Wye". LHe 


2 

matrix is presented in Figure 1) below. Since » i ee hi, 
i=] 

we have that po, = 2, and = |, since there are two unbarred columns 


in the label for A. Clearly 74 = min fer (i=l,..., 6)3 so we choose 
to continue along this branch. Thus Ag will be our next branching 


matrix. 


6 
E l l 2 
H 1 1 2 
iJ 1 1 
G 1 l 
36 : 2 N l 
Sub=-matrix Ag. "3,2, 0 P6 = 2 7 = } 
Figure ll 


We derive sub-matrix AL from Ag (see Figure 15) by deleting 
column one of és corresponding to column 7 of the incidence matrix. 
This sub-matrix has label, "3,2,4,7". Once again we run into the 
situation of a row sum going to zero, so this sub-matrix, too, repre- 


sents an infeasible combination of columns. Therefore, we set e — ., 


and proceed. 
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fi 
H l t 2 
G l l 
E l l 
S 
7 a S l 
Subemat rix Ane We alse Eo =3 
Figure 15 


By the deletion of rows E and F from Ags we arrive at sub-matrix 
Ag, which also has column one of A, deleted. This two by two matrix 
is displayed in Figure 16, below. The label is "3,2,h,7", and we 
note that $8.1 "Mg * 23; therefore, oa. e 1, and since there are three 
unbarred columns in the label, Ef =}, Note also that the column 
sum of column 6 of the original matrix has gone to zero, so we delete 
that column also. We see that ef = min [et] Gi=t, .«s, 8), so we 
branch on matrix Ag. We remember to set EB =o , and choose column one 


of Ag for examination. This column corresponds to column 8 of the 


original matrix. 


8 o 
1 2 R 
8 
H l 1 2 
G 1 1 
S 2 1 
8 
Sub-matrix Ag Li ny a Pe = } eR = ) 
Figure 16 
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Deletion of column one of A, gives us a one by one sub-matrix 
which has label "3,2,h,7,8". This matrix represents an infeasible 
combination of columns since row G has vanished. Thus we set 
EG m o, and proceed. 

We see immediately that we have reached termination, since the 
matrix Aro is of zero dimension and has label "3,2,h,7,8". This 
means that ae = }} and that ah = min [eq] (10, <.., 10)% 

The l-width of the incidence matrix is h, and a minimal l-set of 
representatives for the incidence matrix is the set of columns, (2,h, 


7,8). These, of course, would be the station numbers that were to be 


targeted in our original problem. 


laws 
5. Computation of €(4). 


We notice that for the very simple problem presented in section 
four, ten matrices had to be written down. The writer has observed 
that in hand computation, one matrix can be used for deriving only 
two or three sub-matrices before the paper becomes impossible to read. 
Even a small matrix requires a considerable amount of time to write 
down, especially when normalization cannot be done in one's head. In 
the computer version, due to limited storage space it is necessary 
to recompute a matrix each time it must be used, so even at high 
digital computer speeds, it would be desirable to reduce as far as 
possible the number of matrices that had to be examined. 

Unfortunately, for the computation by hand, little can be done 
to simplify the problem, but in the case of the computer algorithm, it 
is possible to compute “e( a) exactly at little expense in time. Un- 


fortunately, this computation will be useful only when ao 1; and 
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hence, when the components of the RI vector of section three can be 
only zero or one. But this case is the one which is of greatest 
interest, and it is thus very worthwhile to study this computation. 

There are at least two derivations possible. One, which is 
entirely combinatorial in nature, gives considerable insight into 
the class, “/{(R,S), at the expense of being quite lengthy and not 
very intuitive. The interested reader is referred to Fulkerson and 
Ryser. [lh]. 

We shall use a network derivation which is considerably shorter 
and more intuitive. The procedure for a= 1 is outlined in [2]. It 
should be noted that the formula was first derived using network cone 
siderations. We require the following theorem in the network deriva- 
tion to follows 
Theorem 5.1. 

Let A c« Y(R,S) have a-width, c(a). Then there is at least one 
matrix, Aus in YU(R,S) such that the first e« columns of A constitute 
a minimal a-set of representatives for A. 

Proofs Consider any matrix, A « AL(R,S) with a-width, e(a). 

Let Ex be that subset of the columns of A consisting only of the members 
of the minimal a-set of representatives. If E* is the first e(a) 
columns of A, thenA = A.. Therefore we assume that column p is the 
leftmost column of A not in EX. Now locate column k such that column 

k is the rightmost column of A in EX. 

Let Re = (Trays coos Tom) be the vector of row sums of EX. Now 
if r,, = a and there is no a> O for which 24 = 1, (i=l, 2-2, m), 
we may replace cclumn k by column p in E*, and the new columns of E* 
are a minimal a-set of representatives. Suppose therefore, that we 


ha@verfor some i, r., #a , Bip = Q and ay © le We-e€all a...a critical 


Ei k ik 
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one of Ex. Then since me ai S there must be an aa = 1 for which 


kK? 
an. ® O. (j7#1). Further, for each critical one in column k, there 
is a distinct one in column p with a corresponding zero in column k. 
We perform interchanges on such critical ones, the typical interchange 
resulting in ve = 1; ain = Q; aie = O3 and a = 1, We may now 
replace column k by column p in E* and the new columns of E* form a 
minimal a-set of representatives. 

Clearly this construction is possible for each column to the left 


of column €, which is not in E*. Hence the construction yields 


A. © WU(R,S). QED 


5.1 =A Supply-Demand Network. 


In this section we shall consider directed networks. Let N 
represent the set of nodes of a network and QM represent the set of arcs. 
We denote an arc between x and y, members of N; by the ordered pair, 
(x,y) and assert that the notation implies the arc is directed from x 
to y, and is not the same arc as the one denoted (y,x). We associate 
with each arc in @, a non-negative function c(x,y) called a capacity 
function, and a non-negative function f(x,y) called a flow function. 

We associate with some nodes in N a non-negative function a(x) which 
may be thought of as a supply of some commodity available at node x, 
and we associate with some other nodes in N, a non-negative function 
b(y) which may be thought of as a demand for some commodity by node y. 

We make use of the following shorthand notation, Let S, T, be 
subsets of N, and let x,y be elements of N. Then by c(S,x) we mean 
Be c(s,x), and similarly for f(S,x). Also, by c(S,T) we mean 
ses ter c(s,t), and similarly for f(S,T). Analagous shorthand will 


be used for the functions, a and b. 
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Now let us assume we have a class of matrices, “(R,S). We 
devise a network for this class as follows: Let there be n nodes 
denoted by, ..., b,3 with demand function, b(b,) = s,, the fas 
component of S. Let there be m nodes denoted Ays coos aS with supply 
function a(a;) “Ty the ‘a component of R. Let B = D33 A *a,. 

Let (a; 5b;) e@for all i, j. Let c(a;,b,) wl forall i,j. This 
network has an arc capacity of one for each of the men elements of a 
matrix, A « Y(R,S). The commodity available at the nodes of A, and 
required by the nodes of B is, of course, "1""'s to distribute among 
these men elements of the associated class of matrices. We construct 


a flow in the network satisfying the following constraints: 


(6) f(x,N) - £(N,x)= a(x) xe A 
(7) f(N,x) = f(x,N) = b(x) x € B 
(8) O =4f(x.y) = (x.y) (xy) e @ 


Clearly this construction is possible. For R = 2,2,2,2) and 
S =(3,3,2) such a network has been constructed in Figure 17. The 
number by each arc is the value of the flow function for that arc. 
Now let us construct the corresponding matrix (Figure 18). If 


f(a;,b;) « 1, let a;, = 13 if f(a; 5b;) = 0, let ay; * 0. This matrix, 


2 
A, is.in the class, AS(R,S). Furthermore, each unique feasible flow 
corresponds to a unique matrix, A € Y{(R,S), and conversely. 

Then let us ask this question: under what conditions can we 
construct a flow so that f(a;,T)Za for each i, and forT = (by, coe, 
b_}? This flow would correspond to distributing at least © ones 
from each a; to the nodes corresponding to the first € columns of a 


matrix in ALR,S). If we can locate the smallest € for which this 


flow is feasible, we shall have found pee See Theorem 5.1. 
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Since vm = ye for the demands to be satisfied at 
each oe the supply at the etaee of A must be totally exhausted for a 
feasible flow. Consider the network of Figure 19, with four sink 
nodes (those with positive demand functions), and three source nodes 
(those with positive supply functions). Let us construct for each 
e =n; a network as follows: Let B. be the first ¢ nodes of B. Let 
(x,sy)e @ for x cA andy ¢ B_. Now construct m new nodes 
(a}, ee at) and let the set of these nodes be called At. Let 
(a; are @ for each im. Let (x,y)e @ for xc A' andy e Bw B. 


(the relative complement of B with respect to B). Let the following 


be true: 
a(a,) <0 i=). gees we 
b(d,) aS Se ae 0 
eter?) = c(ai sb, ) - ‘ " 
k =e+l, re 
c(a;,at) i PEL, «xs hh 


The construction corresponding to the network of Figure 19 for 
€ = 2 is shown in Figure 20 on the preceding page. Numbers above 
each arc are the capacities of the arc. What we have done is this: 
since the capacity of each (a;,at) is a units less than the supply at 
a;, at least 2 units of supply must be distributed to the nodes of Bes 
We call a flow feasible if and only if constraints (6), (7), and (8)) 
are satisfied and if 

(9) f(x,N) = f(N,x) = 0 xe At 
is also satisfied. Clearly, the smallest e« for which a feasible flow 


ct 
exists in this type of network is e(a). 


By 








> 








a 





ane 
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Theorem 5.2. 


The constraints 


(6) f(x,N) = f(N,x) = a(x) xed 
(7) f(N,x) - £(x,N) = b(x) ceo 
(8) 0 =f(x,y) = ¢(x,y) (x,y)e @ 
(9) f(x,N) - f(N,x) = 0 xo Pena 


where a(x) =0, b(x) = O; are feasible if and only if, 
(10) (BX) = aA X) E c(X,X) 
holds for every partition of N into subsets X and X (= N~ xX). 
This is the well known supply-demand theorem due to Gale. A 
proof may be found in [2]. 
We apply (10 to our network (in general) and observe that for 
partitions of the forms 
X = fa,, Pr cl ae 


m 1? o@e@e8 9 


a» {ot te 
X fat sees ATS Dea? 


als Dis cess bos Deap> tees bi} and 


oes 9 de} 
where e and f are integer parameters satisfying 


i 


[lA 


0 


IA 


e —*n; € n 


ITA 
I} A 


(10) is of the forms 


(it) «& + coo tS S(ri., - a) er + (9 -a) + e-(f = ¢) 


e+] 
The validity of the inequality is obvious except possibly for the 
term, e*(f - €). This term is merely the number of at in X, times the 


number of b; in X3; and is the total capacity of all arcs connecting 
these two sets of nodes. 
Theorem 5.3. 

The constraints, (6), (7), (8), and (9) of Theorem 5.2 are feasible 
for a network of the type of Figure 20, and for fixed €, if and only if 


(11) holds for all permissible values of e and f. 


oO 





The proof of this theorem consists of looking at all subsets of 
nodes not of the form on the preceding page, and verifying that Theorem 
5.2 is valid for these subsets if it is valid for the subsets of the 
above form. Since the proof is not very interesting, and rather 
lengthy, it is omitted. 

Theorem 5.3 assures us that we need not test all subsets of nodes 
with (10) in order to assure ourselves that we have a feasible flow. 
Now let us multiply (11) by minus ones; and rearrange terms: We arrive 
ats 


+ oo tr = (sS 


(12) r ” 


tout S_)+ e*(f=c) = a«(m=e) 


e+] etl 


We thus have the condition that © is the smallest integer for which 


(12) is satisfied for all integer values of e and f in the ranges: 


A 
WA 


[A 


0 £ 


A 


e@e=m E n 

Now the left side of (12) is the class invariant which Fulkerson 
and Ryser call N(c,e,f). [lh] For ease of computation we define a 
function, Q(e,e,f) = N(e,e,f) -a «(m= e). € is the smallest « for 
which 

(13) 0 0G 3c.) e=.0 0j¢ lm; Se a 
We take first differences with respect to ce, e and f3 and derive the 
following recursion formulass; 

(1h) Q(e+1,e,f) = Q(e,e,f) + Sey, - @ 

(15) Q(-,e*1,f) 


(16) Q(é,e,f+1) = Q(e,e,f) +e - Sea] 


Q(c,e,f) + fta-e- lo] 
Since we know that P is a lower bound on the a-width of any 

A « QJ(R,S) we may take « = p, and compute an mtl by n-e array 

making liberal use of (15) and (06). If one of the numbers is negative, 


we increment € by one using (1h), and compute the m4+l by n-ec array 


1 





for this new value of €. When an array is found which contains no 
negative members, we have found oS 

Now, aS we mentioned before, we do not advise this procedure for 
hand computation, and it cannot be used for a greater than one, but 
the case a= 1 is the most important case by far, as will be seen in 
section seven, and a digital computer is admirably suited to perform 
these simple arithmetic computations. In the next section we shall 
discuss the computer program in which the above formula was used: 
and the results obtained with a large number of matrices. 

6. The Algorithm Program. 

We present a procedural flow chart for the branch and bound algorithm 
in Figure 21. AS much as possible of the procedure is described in 
abbreviated, but intuitive language. Where variable names are neces- 
sary they are either the names given to the same variables used in 
Sections 3.1 and 3.2, or they are defined on the flow chart itself near 
the point at which they are first used. Variable names are used that are 
in reasonable agreement with the corresponding names used in the computer 
program. 

The algorithm was programmed for the Control Data 160) computer 
using FORTRAN 63 source language, and CODAP 1 assembly language. The 
assembled program is included in this paper as Appendix I. There are 
several features of the program which deserve discussion. 

Since we are dealing with matrices composed of zeros and ones, 
storage space can be conserved by letting a single bit represent an 
element of the matrix. Then we may use logical operations to manipulate 


the matrix. Normally this would require writing the entire program at 
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the assembly language levels but we avoid this by taking advantage of 
a capability of FORTRAN 63 which permits the programmer to define his 
own type of arithmetic. 

The CDC 160) word size is 48 bits. We chose to write the program 
to accept matrices up to dimensions ih by 14. We store a single row 
of the matrix in three consecutive computer words; hence an entire 
matrix requires only 32 words of storage. The first word of row i 
contains elements asi through 231.83 the second word contains 2219 
through 85063 and the third word contains a, 


i97 
We define, according to the rules of FORTRAN 63, a TYPE LOGICS 


through As ihh ° 


arithmetic in which an elemental word consists of three consecutive 
words of memory. We call such an elemental word a TYPE LOGICS word. 
Thus, one LOGIC5 word is equivalent to one entire row of a matrix, or 
other variable which needs to be three computer words in length. For 
instance, we shall require several masks with which to derive the various 
sub=matrices, and each such mask must consist of three computer words. 
We shall need to take logical sums and products, to complement 
words, to clear words of ones in certain bit positions; and we shall 
require a method of generating a 1 in any of the Lh bit positions of 
a LOGICS word. We define, through the subroutine, QLOMATH, the symbol 
t+ to mean logical sums the symbol "*" to mean logical products; the 
symbol "=" with two arguments to mean “set the it? bit of the first 
argument to zero if the yth bit of the second argument is one": and 
the symbol *—" with one argument to mean "complement the argument". 
We also define "ARGUMENT /j" to mean "set the yeh bit of the argument 
to one, and all other bits to zero, cointing from the leftmost bit 


position of the argument". 





The only requirement we have for generating the sub-matrices of 
the given matrix is that we must be able to compute the corresponding 
row sums and column sums for use in estimating ©. We may compute the 
row and column sums in the computer program without deriving each of 
the sub-matrices through the use of suitable masks. We require two 
such masks; one is a mask of columns upon which the program has already 
branched; and the other is a mask of columns chosen for inclusion 
in the minimal a-set at the current branch. In each case, a l in the 
ae bit position of a mask indicates that column i is a member of the 
mcttor columns which the mask represents. 

Almost all arguments used inthe various subroutines are stored 
in COMMON. This decreases the computation time at the expense of re- 
quiring difficult to follow indexing of the parameters. Most such 
parameters are stored in an array, IDATA. This array is really three 
consecutive arrays of parameters associated respectively with the 
matrices Ar» Aoxs and Aone of Figure <1. The correspondence between 
IDATA and the mnemonic variable names may be found in the EQUIVALENCE 
statement near the beginning of the program. 

The masks and bound of all matrices must be retained in storage, 
but other parameters, (row sums, column sums, dimensions, etc.) are 
recomputed each time they are required. If a random access storage 
device (such as a magnetic disc) is available a savings of computation 
time would result from the storage of these parameters. 

Up to 2000 sets of parameters can be retained in core storage 
Simultaneously. When this limit is reached, the section of the program 
from statements 192 to 193 searches for any sets of parameters no longer 


required, discards them and compresses the remaining parameters into the 
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front of the storage area. This effectively increases storage space 

up to the point at which there are 2000 current branches of the algorithm. 
(Current branches are those branches for which the corresponding esti- 
mate of € is less than "infinity".) 

Sample output is shown in Figure 22. This matrix can be recognized 
as the incidence matrix of the communications network discussed in 
Section one. Note that the matrix is printed in octal format which 
must be converted by hand to the proper (0,1) form. Each digit of the 
output represents three elements of the matrix; for example, the digit 
"Ott represents the three elements "1, 0, 1". 

Short, but descriptive comments separate major sections of the 
program listing by tasks, and introduce each of the subroutines. The 
various CDC 160); instruction manuals and programming manuals may be 
consulted for further information. 

The program is not very efficient in its present form: many pro- 
gramming conveniences such as the use of TYPE LOGIC5 arithmetic, and 
the use of subroutines, makes writing of the program simpler at the 
expense of generating many otherwise unnecessary instructions. As a 
first step toward improving the efficiency, the author recommends 
elimination of TYPE LOGIC5 arithmetic, substituting in its place, 
CODAP1 subroutines to perform the necessary substitute operations, and 
using direct calls to these subroutines in place of the operations 
symbols. In addition, it is recommended that all present subroutines 
written in FORTRAN 63 be incorporated into the main program. Program 
space is not critical in a computer the size of the CDC 160, and by 
writing the subroutines as part of the main program, advantage may be 


taken of task specialization. For instance, subroutine ROWSUM computes 


sh 





the sum of all M rows of the matrix each time it is called. It takes 
as much time to compute a row sum which is zero as one which is not; 
but we have information which could be used to specialize the routine 
so that it skips over rcews whose sum is zero. 

A still better technique would be to write the entire program 
at the assembly language levels especially if the user intends to 


use the program for more than the solution of a few matrices. 


6.1 Results of Using PROGRAM WIDTH. 


If a matrix has o-width, ec, and we were to attempt to find the 
a-width by looking at all possible sets of a columns, then all possible 
sets of 4 + ] columns, and so forth up to all possible sets of € = l 
columns and finally some sets of © columns, we should have to look at 


X sets of columns for 


Eo] .€ 
Se ee hae ee 


k=a 
We should have to look at this number of sets of coluUmns using the 


branch and bound algorithm also, if all of the estimates of «¢ which were 
current turned out to be equal. it is conceivable that this could 
happen for some problems hence we must take (17) as an upper bound 
on the number of branches which must be investigated by the program. 
Now the branch and bound algorithm is not the most efficient way to 
search subsets of columns, so we are quite interested in determining 
just how far below the upper bound we can stay by branching and bounding, 
Since we cannot express ary theory to demonstrate the efficiency 
of the algorithm, the only anoiae open to us was to solve many problems 
of varying sizes in hopes that trends could be established. It is for 
this reason that subroutines RANDOM and RANDGEN were added to the pro- 


gram. These two subroutines generate matrices of any size up to 1h 
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by 1h. A uniform random number generator is used to generate three 
consecutive random numbers which represent one row of a matrix of lh) 
columns. If a matrix of N columns is desired (N < 1h) bit positions 
N+ 1, ..., lll of the three word element are set to zero. The number 
of ones remaining in the three words is computed and compared to a user 
supplied argument, NONES. If NONES is less than the remaining ones in 
the three word element, another set of three words is generated, the 
appropriate bit positions cleared to zero, and then the logical product 
of the two elements is taken. This procedure is repeated until NONES 
is greater than or equal to the number of ones remaining in the three 
word element. Thus NONES represents the maximum permissible row sum 

of any row in the matrix. The three word element is then assigned as 

a row of the matrix, and the procedure is repeated until an entire 
matrix has been generated. We are thus reasonably sure of a random 
distribution of ones throughout the matrix, and we have some control 
over the density of ones in the matrix. Matrices of any dimension are 
generated in no more than a few seconds. 

Our original plan was to generate and solve five matrices of each 
of 112 sets of dimensions for the matrix. It was felt that such a set 
of matrices would be a statistically significant sample from which 
computation time could be functionally related to such parameters as 
matrix dimensions. Unfortunately, time has prevented the completion 
of this scheme. Hence all remarks that follow in this section are with- 
out statistical significance. 

We have been able to generate and solve over 200 matrices of vary- 


ing dimensions for their l-width; one being by far the most important 
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value for % Dimensions of matrices generated were from the 8 by 9 
problem of section one to matrices of dimensions 1h) by 25, and 35 by 
100. Some relatively square matrices of size 50 by h5 are included. 
As is to be expected, computation time varies directly with number of 
branches considered when matrix dimensions are held constant. Let us 
therefore make some remarks about the number of branches considered 
by the program in solving these matrices. 

It is clear that the number of branches is a function of the 
number of columns and of the actual l-width of the matrix. Not quite 
so obvious is that the number is a function of the number of rows in 
the matrix. However the fluctuations apparent in the number of branches 
is very wide. For instance, for one matrix 260 branches were taken 
while for another, 1536 were taken. Both matrices were of dimensions 
35 by 35, and had a l-width of seven. It is apparent that other 
factors must be involved. One such factor is the distribution of ones 
in the matrix. One matrix, a Steiner triple system [5], which is a 
matrix which among other properties has all row sums equal and all 
column sums equal; required investigation of 1216 branches before com- 
puting the l-width as nine. Yet this matrix had only 35 rows and 
15 columns. The symmetry of the matrix made it difficult to weed out 
unprofitable branches. Another matrix of dimensions 50 by 5 exceeded 
the capacity of the program storage after 218) branches. In every 
case, however, the number of branches were below the upper limit given 
by (17). Values seldom exceeded 600 for any of the matrices. 

Of more practical interest is the time required for computation. 


The CDC 160) has an effective cycle time of ).8 u sec. The longest 
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time required to solve any problem was 36 minutes, although there were 
problems which had not been solved when the program was stopped by the 
operator after about 5 minutes. The matrix which required 36 minutes 
was of dimensions 50 by 5 and had a lewidth of six. The relationship 
between branches and computation time is rather interesting. Matrices 
of dimensions 125 by 25 required a little over one second per branch 
whereas matrices of dimensions 25 by 120 required between two and three 
seconds per branch. This seems to verify that advantage could be gained 
by eliminating the LOGICS arithmetic in favor of more efficient methods, 
since the amount of LOGICS arithmetic required increases with number of 
columns. 

Computation time was graphed on semi-log paper versus 1) number 
of columns, 2) number of rows, and 3) lewidth of the matrix. Figure 23 
is a graph of time versus number of columns for matrices of 25 and 35 
rows. Figure 2) is of time versus number of rows for matrices of 25 
and 35 columns, and Figure 25 is of time versus l-width for matrices 
of dimensions 20 by 20. 

From these graphs, it seems reasonable to conclude an exponential 
increase in computation time versus beth number of rows and number of 
columns. No hypothesis is made about the parameters of the function. 
Our method of generating matrices degrades the validity of Figure 25. 
In order to create matrices of high lewidth, we can oniy lower the den- 
sity of ones in the matrix. This, in turn, increases the likelihood 
of rows of sum ones which results in an artificial simplification of the 
problem. This is apparent especially in the case of the matrices of 


lewidths nine ami ten in Figure 25. 
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The validity and usefulness of the algorithm has been established 
by the above results. Most computation times were under 20 minutes, 
and it is felt that computation times could be reduced more than 25% 
by cleaning up the program and doing without the pregramming convenience 


of TYPE LOGICS arithmetic. 


7. Applications and Extensions. 


In this final section we consider applications of the branch and 
bound algorithm to solution of real-world problems, and certain famous 
problems of the mathematical puzzle category. We shall also propose 
certain extensions of the algorithm as presented in section three, which 
enlarge the class of problems which may be soived. Certain of the 
problems may, indeed, be more easily solved by other methods, but they 
are presented here to illustrate the variety of problems which may be 


formulated in terms of finding the C-width of a (0,1) matrix. 


7-1 The Eight Queens Problem. 


A famous mathematical puzzle is the followings place the maximum 
number of queens on a chess board so that no two may attack each other. 
We construct a graph of 6); nodes, one for each square on the chessboard. 
Connect two nodes if a queen may move from one node to the other. The 
minimal l-set of the node-arc incidence matrix is a minimal set of 
nodes that touch all arcs of the graph. Now since this matrix has row 
sums which are all equal to two, the sub-matrix consisting of all columns 
not in the minimal l-set has row sums of at most one. Hence in the 
graph corresponding to this subematrix, there is no connection between 


any of the nodes. Thus the complement of the minimal l-set of nodes 
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represents square at which the maximum mumber of queens may be placed 
so that no two may attack each other. This problem is a special case 
of a class of problems which also includes the targeting problem of 

Section one. We next present a description of this general class of 


problen. 


tee Lhe Connecting Nodes Problem. 


Find the fewest number of nodes that touch all arcs in a graph. 
Here the rows of the incidence matrix are arcs of the graph, and the 
columns are nodes. The i-ewidth of the incidence matrix is the solution 
to the problem. In addition to the targeting problem of Section one, 
another problem of this type is the following: 

Given a communications system of some type (let us say a system 
of highways connecting towns), what is the minimum number of arcs (high- 
ways) which must be kept safe from attack (natural disaster, etc.) so 
that no node is iseclated? In order to solve this problem, we construct 
an incidence matrix as follows: list the arcs as columns and the nodes 
as rows. Let ai; = 1 if the it? node is a terminal of the gth ATC o 


Then the lewidth of this matrix is the solution to the problem. 


led Simplification of Logical Functions. 


We present this problem to show the range of interpretations which 
may be made from the concept of a-width. The reader is cautioned, 
however, that the incidence matrix required for using the branch and 
bound algorithm to solve this problem is likely to become prohibitively 


large. 
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Given a truth table for a proposition letter formula, F, inr 
proposition letters, Pps ees P,3 find a disjunctive normal form for 
F which has the fewest number of terms. If we let "&" represent the 
conjunction operator; "+" represent the disjunction operators and a 
be the negation of a, then the disjunctive normal form for a proposi- 
tion letter formula is of the forms: 

(Pp, & Po) + (p, & p3 & P)) + (Pp, & Pp) + --. 
where the p; are proposition letters. The expressions enclosed within 
parentheses are called terms. The problem is of interest in switching 
circuits and in the logical design of digital computers. 

For columns of the incidence matrix take all terms having one 


of the forms: q33 4; & q33 woof GT, & eee & q.3 where q. is either P;: 


1 
or its negation; and such that the term takes the value "true" only 

if F does also for all values of the P. not explicitly present in the 
term. For example, if Py & P3 is a term of F in three proposition 
letters, Py» Pos P33 then both F(P) »PosP3) and F(Pys Pps P,) must be 
true if P, & P, is true. We next construct a row of the incidence 
matrix for each "true" entry of F in the truth table. Place a one in 
the column corresponding to the assignment of values to Pye eres PL 
which makes up the entry in the truth table corresponding to the "true" 
entry of F. Then place ones in all other columns which are also true 
for this assignment of values to the p;- Thus if P, & Po & P, makes 

F true, a row of the matrix would have a one under this column label 


as well as under Py & Py3 P & P, and so forth. 


i 

As an example, consider the truth table of Figure 26. The columns 
of the matrix would be labelled "p, & py"; "p, & P"5 "Do & p35 P)&Po8P3" 
"Dy & Po & P3"5 "Fy & Po & P3"3 "Py & Po & P's The four rows would 


have ones in columns labelled as follows: 


6h, 


- 
oe 


a — 
a 
a 7 
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Figure 26 


Row 1: Py & P33 P, & P, & P 
Row 2: p, & Po3 P, & P33 Py & p, & p 


2 é 


Row 3: p, & Py} BD, & P, & P, 
3 Py & P33 Py & Py & P, 


and Py & Py are a minimal l-set of representatives 


- 
3 


3 
for the matrix, and F = (p, & py) 4 (P, & P,) is a minimal disjunctive 


Row lis P, 


Clearly, Py & Dp 


normal form. 


7-4 The Minimal C-cover Problem. 

It would be quite simple to extend the computer program to solve 
the minimal C-width problem. Essentially all that would be necessary 
is input revision to accept the vector, C, and the initial setting of 
the vector, RI toc. Of course, © could not be calculated, but would 
have to be estimated using exactly the same subroutine which is presently 
used for the situation, © > 1. For an entirely different algorithm for 
solving the minimal C-cover problem (and hence also the a-width problem) 
see [8]. The minimal C-cover extension is of interest primarily as a 


first step to a more involved and more useful extension. 
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7-5 A Minimal Cost Cecover. 

One of the more obvious deficlencies of the solution to the 
targeting problem of section one is that when only one leset is computed, 
that particular set might include a target very heavily fortified 
whereas one not as heavily fortified might have been a member of another 
minimal Geset. One approach to remedy this deficiency would be to 
compute all minimal a-setss: and indeed the approach will be mentioned 
subsequently. However, it is also possible that for some variety of 
reasons, it would be preferable even to destroy more than the minimum 
number of targets. The term, preferable, indicates that there might 
be a utility function or cost function associated with the problem. 

The extension of the algorithm so that it may handle costs 
associated with the columns is perhaps the most interesting extension 
that we shall discuss. The author believes that this extension might 
result in a decrease in the computation time required. The belief is 
based upon the obszrvation that the lower bounds calculated in the pre- 
sent program are relatively close to each other. Thus there is entirely 
too much switching away from one branch, to another, and then back to 
the original branch. With a wide difference among the column costs, 
however, the differences among the various estimates of the Cewidth 
of the original matrix should be equally wide. This will serve to 
reduce the unnecessary switching from branch to branch. That is, it 
is more likely that when a branch is dropped by the algorithm, it is 
because that branch has become unprofitable. 

Suppose we assign to each column of the matrix 4 cost, which need 
not be integral, and may be positive, negative, or zero. We would 


then be interested in finding a Cesover (or ana-set) which has 
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minimum cost associated with it. Of course, such a C-cover might 
not be a minimal Cecover as defined previously. 

The modification to the computer program would be surprisingly 
simple. The cost vector would be read in, and let us assume stored in 
COMMON. Now the "infinity" for unfeasible column combinations must be 
increased to some arbitrarily large number. An estimate of €(C) would 
be calculated for each sub-matrix using the same subroutine as present- 
ly used for a> 1. From subroutine BOUND, however, the program 
would enter a new subroutine, such as subroutine COST presented in 
Figure 27. In this subroutine, a cost for the p just computed would 
be estimated. The estimate would be optomistic in the sense that the 
cost for the columns would be the sum of the smallest cost components 
not already used on this branch. For example, consider a cost vector, 
(1,2,3). Assume that, on the current branch, column one has been 
either included or excluded, and that we have computed p = 1. Then 
the cost for the sub-matrix would be estimated as two; and the esti- 
mate of the cost for the minimal C-cover would be two plus the cost of 
column one, if column one had been included, or two, if column one 
had been excluded. 

Finally, either in the same subroutine, or in the main program, the 
cost of the set of currently included columns would be computed and 
stored in place of the argument, VCOL(I) of the current program. Also, 
VEPSILON(I) of the current program would be replaced by the sum of 
VCOL(I) and the cost estimate just computed in subroutine COST, as 
described in the above paragraph. 

The reader is reminded that the subroutine COST of Figure 27 
has not been checked out and that the remarks in this section about 
decreasing total computation time represent merely the author's 


intuition and are not based upon observations. 
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SUBROUT INE COST (EPSILON, I) 
COMMON /BLOC KH /CCOST ( 14) /BLOCKB /IDAT A( 1761) 
EQUIVALENCE (MASKC, IMASK(3)) 
TYPE LOGIC5 (3) MASKC, BIT 
DIMENSIONS TEMP( 1h) 
pd 5 jl, 3 
5 IMASK(J) = IDATA( (587%(I-1) )+582-+J ) 
N = IDATA( (587%(I=1) )+3) 
BIT = BIT * MASKC 
IF (BIT.EQ.0) 10, 20 
10 Ke K + 1 
TEMP(K) = CCOST (J) 
IF (TEMP(K).LT.TEST) 15, 20 
15 TEST = TEMP(K) 
Kl = K 
L = EPSILON 
20 CONT INUE 
IF (K.LT.L) 25, 30 
25 EPSILON = 1.E+20 
RETURN 
30 EPSILON = TEST 
Jel 
35 Jw J +i 
TEST 1 = TEMP(i) 
DO WS M = 2, K 
IF (TEMP(K) .GE.TEST .AND.TEMP(M) .LT .TEST1.AND.K1.NE.M) ho, hS 
hO TEST 1 = TEMP(M) 
4S CONT INUE 
EPSILON = EPSILON + TEST1 
TEST = TESTi 
IF (J.EQ.L) 50, 35 
50 RETURN 
END 


Figure 27 
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Once both of the above extensions have been programmed we may use 
the algorithm to solve a large variety of problems which are a sub- 


class of the set of integer programming problems. 


7.6 An Integer Program with a (0,1) Constraint Matrix. 


We merely point out in this section, a formulation of a problem 
which the extended algorithm can solve. Given the system of linear 


inequalities: 
n 


2 3” . b i = ] 2 eee 
ee as : i a es . mm 


where aay is either zero or one; find values for each x such that a 


IV 


is either zero or one, which minimizes: 
n 


rors 


Here the Cs are costs, and the b, are the components of what has pre- 


viously been called the C vector. 


7.7 Constraints on Combinations of Columns. 

Suppose that upon any of the problems which may be solved by 
extensions of the algorithm, we impose constraints of the following 
types if column a is included in the C-cover, then column b must be 
excluded. 

We could write a relatively short subroutine to handle this type 
of constraint. It would be necessary to put the constraints in a con- 
venient form, say for each constraint construct a mask of zeros except 
in the bit positions corresponding to columns which cannot be included 
together. For example, let there be five columns and assume two 
constraints: that columns one and two cannot be included together, and 
that columns three and five cannot be included together.Then the two 


masks would be: 
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11000 and 00101. 
We put these constraints into the program in some convenient fashion 
(probably by the same system used for putting the A matrix in the 
current program): and write a subroutine to compute the logical pro- 
duct of each constraint with the mask of included columns. If there 
are no ones in the product for any constraint, the subroutine must set 
a "current" cost vector equal to the input cost vector. If there are 
two or more ones in any single product, the subroutine must indicate 
that an infeasible column selection has been made. Finally, for each 
constraint with exactly one "1" in the product, set the "current" 
cost of every "1" in the constraint to "infinity": except of course, 
the "1" representing the current column inclusion. We use the 
"current" cost vector in computing bounds instead of the input cost 
vector. 


7.8 Finding All Minimal C-covers of the Matrix. 





It is possible to simplify the search for minimal C-sovers once 
the first one has been located. The same algorithm applies except that 
we have information to rule out as infeasible, any set of columns 
which yields an estimate of e(C) larger than the computed C-width. 
Although the simplification would contribute to a substantial savings 
in computation time for each additional minimal Ce-cover, it is believed 
that for most problems the search for all minimal C-covers would ree 


quire more computation time than the results would warrant. 
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